o±-?o 






3 


(1) 




5 






7 






8 






10 






12 






13 






14 






15 






16 






17 






18 






20 






21 






22 






23 






24 






26 




c--> 


27 




c--> 


28 






29 






31 






33 






34 






35 






36 






38 


( 




39 






40 






41 






43 






44 






45 






48 


(2) 




50 






51 






52 






53 






54 






56 






61 






63 






RAW SEQUENCE LISTING DATE: 05/07/2002 

PATENT APPLICATION: US/10/086,208 TIME: 11:26:07 

Input Set : N:\Crf3\RULE60\10086208.raw 
Output Set: N:\CRF3\05072002\J086208.raw 

SEQUENCE LISTING 

GENERAL INFORMATION: 

(i) APPLICANT: Goldstein, Gideon 
(ii) TITLE OF INVENTION: Methods and Compositions for Impairing 

Multiplication of HIV- 1 
(iii) NUMBER OF SEQUENCES: 124 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr. , P.O. Box 457 

(C) CITY: Spring House 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19477 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 
(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/10/086 f 208 

(B) FILING DATE: 28-Feb-2002 

(C) CLASSIFICATION: 
(Vii) PRIOR APPLICATION DATA: 
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(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/DOCKET NUMBER: GGP2AUSA 
(ix) TELECOMMUNICATION INFORMATION: 
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(B) TELEFAX: 215-540-5818 
INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
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64 




1 








5 










10 










15 


66 




Gin 


Pro 


Lys 


Thr 


Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 


Lys 


Lys 


Cys 


Cys 


67 










20 










25 










30 




69 




His 


Cys 


Gin 


Val 


Cys 


Phe 


Thr 


Thr 


Lys Gly Leu Gly 


He 


Ser Tyr 


- 70 








35 










40 










45 






72 




Arg 


Lys 


Lys 


Arg 


Arg 


Gin 


Arg 


Arg 


Arg 


Ala 


Pro 


Gin 


Asp 


Ser 


Gin 


73 






50 










55 










60 








75 




His 


Gin 


Val 


Ser 


Leu 


Ser 


Lys 


Gin 
















76 




65 










70 




















78 


(2) 


INFORMATION : 


FOR ; 


SEQ 


ID NO: 2 


















80 




(i) 


SEQUENCE CHARACTERISTICS: 
















81 






(A) 


i LENGTH 


: 912 base pairs 
















82 






(B) 


i TYPE: l 


nucleic < 


acid 


















83 






(C) 


i STRANDEDNESS : double 
















84 






(D) 


> TOPOLOGY: 


linear 


















86 




(ii) 


MOLECULE TYPE: 


CDNA 




















89 




(ix) 


FEATURE 


























90 






(A) 


NAME/KEY: ' 


CDS 




















91 






(B) 


LOCATION: 


join(l. .876, 


883 


. .912) 










94 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 2: 












96 i 


GAG ' 


CTC TAC AAA TCC GGG i 


GAT CCG GGT GAA GAT CCG CGT TTA GAG CCG 


97 - 


Glu 


Leu Tyr Lys Ser Gly Asp I 


Pro Gly Glu Asp Pro Arg Leu Glu 1 


Pro 


98 


1 








5 










10 










15 




100 


TGG 


AAA 


CAC 


CCG 


GGT 


TCT 


GGT 


TCT 


GTT 


GAC 


CCT 


AAC 


CTT 


GAA 


CCT 


TGG 


101 


Trp 


Lys 


His 


Pro 


Gly 


Ser 


Gly 


Ser 


Val 


Asp 


Pro 


Asn 


Leu 


Glu 


Pro 


Trp 


102 








20 










25 










30 






104 


AAG 


CAT 


CCT 


GGC 


AGC 


TCC 


GGA 


GTC 


GAT 


CCC 


AAA 


CTC 


GAG 


CCC 


TGG 


AAA 


105 


Lys 


His 


Pro 


Gly 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Lys 


Leu 


Glu 


Pro 


Trp 


Lys 


106 






35 










40 










45 








108 


CAC 


CCC 


GGA 


AGT 


TCG 


GGG 


GTA 


GAC 


CCA 


TCT 


CTG 


GAA 


CCA 


TGG 


AAG 


CAT 


109 


His 


Pro 


Gly 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Ser 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


110 




50 










55 










60 










112 


CCA 


GGG 


AGT 


GGT 


AGC 


GTG 


AAT 


CCG 


TCA 


TTA 


GAG 


CCG 


TGG 


AAA 


CAC 


CCG 


113 


Pro 


Gly 


Ser 


Gly 


Ser 


Val 


Asn 


Pro 


Ser 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


114 


65 










70 










75 










80 


116 


GGT 


TCA 


TCT 


GGA 


GTT 


GAT 


CCT 


CGC 


TTG 


GAA 


CCT 


TGG 


GAG 


CAT 


CCT 


GGT 


117 


Gly 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Glu 


His 


Pro 


Gly 


118 










85 










90 










95 




120 


TCG 


TCC 


GGT 


GTA 


GAC 


CCC 


CGA 


CTT 


GAG 


CCC 


TGG 


AAT 


CAC 


CTC 


GGG 


AGT 


121 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Asn 


His 


Leu 


Gly 


Ser 


122 








100 










105 










110 






124 


TCA 


GGC 


GTA 


GAT 


CAT 


CGG 


CTC 


GAA 


CCA 


TGG 


AAA 


CAT 


CCA 


GGT 


TCT 


GGA 


125 


Ser 


Gly 


Val 


Asp 


His 


Arg 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


Gly 


Ser 


Gly 


126 






115 










120 










125 








128 


GAT 


CTG 


CGC 


CAG 


CGG 


CGA 


CGT 


ACT 


CCT 


CAG 


GAT 


TCT 


GGA 


TCT 


CGA 


CAA 


129 


Asp 


Leu 


Arg 


Gin 


Arg 


Arg 


Arg 


Thr 


Pro 


Gin 


Asp 


Ser 


Gly 


Ser 


Arg 


Gin 


130 




130 










135 










140 










132 


CGT 


CGG 


CGC 


CCT 


CCC 


CAA 


GAC 


TCC 


TCA 


GGA 


CGG 


CAG 


CGC 


CGA 


CGA 


CCC 


133 


Arg 


Arg 


Arg 


Pro 


Pro 


Gin 


Asp 


Ser 


Ser 


Gly 


Arg 


Gin 


Arg 


Arg 


Arg 


Pro 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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134 


145 






150 










155 










160 




136 


CCA CAG GGT 


TCA 


GGT 


TCA 


CGT 


CAA 


CGA 


CGC 


GGT 


CCA 


CCC 


CAA 


GGC 


TCG 


528 


137 


Pro Gin Gly 


Ser 


Gly 


Ser 


Arg 


Gin 


Arg 


Arg 


Gly 


Pro 


Pro 


Gin 


Gly Ser 




138 






165 










170 










175 






140 


GGT TCG CGC 


CAG 


CGG 


CGA 


CGT 


CCG 


CCT 


CAG 


AAC 


TCT 


AGT 


GGA 


CGA 


CAA 


576 


141 


Gly Ser Arg 


Gin 


Arg 


Arg 


Arg 


Pro 


Pro 


Gin 


Asn 


Ser 


Ser 


Gly 


Arg 


Gin 




142 




180 










185 










190 








144 


CGT CGG CGC 


TCT 


CCC 


CAA 


GAT 


TCC 


GGC 


GGG 


CGG 


CAG 


CGC 


CGT 


CGA 


TCA 


624 


145 


Arg Arg Arg 


Ser 


Pro 


Gin 


Asp 


Ser 


Gly 


Gly 


Arg 


Gin 


Arg 


Arg 


Arg 


Ser 




146 


195 










200 










205 










148 


CCA CAG AAC 


TCA 


GGT 


GGG 


CGT 


CAA 


CGA 


CGC 


CGG 


ACT 


CCG 


CAA 


TCT 


TCA 


672 


149 


Pro Gin Asn 


Ser 


Gly 


Gly 


Arg 


Gin 


Arg 


Arg 


Arg 


Thr 


Pro 


Gin 


Ser 


Ser 




150 


210 








215 










220 












152 


TCC GGC CGC 


CAG 


CGG 


CGA 


CGT 


GCC 


CAT 


CAG 


AAT 


AGC 


GGC 


AGC 


CGA 


CAA 


720 


153 


Ser Gly Arg Gin 


Arg 


Arg 


Arg 


Ala 


His 


Gin 


Asn 


Ser 


Gly 


Ser 


Arg 


Gin 




154 


225 






230 










235 










240 




156 


CGT CGG CGC 


GCA 


CAC 


CAA 


GAC 


AGC 


AGT 


GGG 


CGG 


CAG 


CGC 


CGT 


CGA 


GCG 


768 


157 


Arg Arg Arg 


Ala 


His 


Gin 


Asp 


Ser 


Ser 


Gly 


Arg 


Gin 


Arg 


Arg 


Arg 


Ala 




158 






245 










250 










255 






160 


CCT GAA GAT 


AGT 


GGT 


TCT 


CGT 


CAA 


CGA 


CGC 


CGG 


GCT 


CCC 


CCT 


GAC 


AGC 


816 


161 


Pro Glu Asp 


Ser 


Gly 


Ser 


Arg 


Gin 


Arg 


Arg 


Arg 


Ala 


Pro 


Pro 


Asp 


Ser 




162 




260 










265 










270 








164 


TCC GGA CGC 


CAG 


CGG 


CAA 


CGT 


GCA 


CCA 


GAT 


AGT 


TCC 


TCA 


GGT 


CAT 


CAC 


864 


165 


Ser Gly Arg 


Gin 


Arg 


Gin 


Arg 


Ala 


Pro 


Asp 


Ser 


Ser 


Ser 


Gly 


His 


His 




166 


275 










280 










285 











168 CAC CAT CAT CAC TAATAA GAA TTC GGA TCC TCT AGA GTC GAC AAG CTT 912 

169 His His His His Glu Phe Gly Ser Ser Arg Val Asp Lys Leu 

170 290 295 300 
173 (2) INFORMATION FOR SEQ ID NO : 3: 

175 (i) SEQUENCE CHARACTERISTICS: 

176 (A) LENGTH: 302 amino acids 

177 (B) TYPE: amino acid 

178 (D) TOPOLOGY: linear 
180 (ii) MOLECULE TYPE: protein 

182 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 



184 


Glu Leu Tyr 


Lys 


Ser 


Gly 


Asp 


Pro 


Gly 


Glu 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


185 


1 




5 










10 










15 




187 


Trp Lys His 


Pro 


Gly 


Ser 


Gly 


Ser 


Val 


Asp 


Pro 


Asn 


Leu 


Glu 


Pro 


Trp 


188 




20 










25 










30 






190 


Lys His Pro 


Gly 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Lys 


Leu 


Glu 


Pro 


Trp 


Lys 


191 


35 










40 










45 








193 


His Pro Gly 


Ser 


Ser 


Gly 


Val 


Asp 


Pro 


Ser 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


194 


50 








55 










60 










196 


Pro Gly Ser 


Gly 


Ser 


Val 


Asn 


Pro 


Ser 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


197 


65 






70 










75 










80 


199 


Gly Ser Ser 


Gly 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Glu 


His 


Pro 


Gly 


200 






85 










90 










95 




202 


Ser Ser Gly 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Asn 


His 


Leu 


Gly 


Ser 


203 




100 










105 










110 
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205 Ser Gly Val Asp His Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gly 

206 115 120 125 

208 Asp Leu Arg Gin Arg Arg Arg Thr Pro Gin Asp Ser Gly Ser Arg Gin 

209 130 135 140 

211 Arg Arg Arg Pro Pro Gin Asp Ser Ser Gly Arg Gin Arg Arg Arg Pro 

212 145 150 155 160 

214 Pro Gin Gly Ser Gly Ser Arg Gin Arg Arg Gly Pro Pro Gin Gly Ser 

215 165 170 175 

217 Gly Ser Arg Gin Arg Arg Arg Pro Pro Gin Asn Ser Ser Gly Arg Gin 

218 180 185 190 

22 0 Arg Arg Arg Ser Pro Gin Asp Ser Gly Gly Arg Gin Arg Arg Arg Ser 
221 195 200 205 

223 Pro Gin Asn Ser Gly Gly Arg Gin Arg Arg Arg Thr Pro Gin Ser Ser 

224 210 215 220 

226 Ser Gly Arg Gin Arg Arg Arg Ala His Gin Asn Ser Gly Ser Arg Gin 

227 225 230 235 240 

229 Arg Arg Arg Ala His Gin Asp Ser Ser Gly Arg Gin Arg Arg Arg Ala 

230 245 250 255 

232 Pro Glu Asp Ser Gly Ser Arg Gin Arg Arg Arg Ala Pro Pro Asp Ser 

233 260 265 270 

235 Ser Gly Arg Gin Arg Gin Arg Ala Pro Asp Ser Ser Ser Gly His His 

236 275 280 285 

238 His His His His Glu Phe Gly Ser Ser Arg Val Asp Lys Leu 

239 290 295 300 
241 (2) INFORMATION FOR SEQ ID NO: 4: 

24 3 (i) SEQUENCE CHARACTERISTICS: 

244 (A) LENGTH: 9 amino acids 

245 (B) TYPE: amino acid 

246 (C) STRANDEDNESS : 

247 (D) TOPOLOGY: linear 
249 (ii) MOLECULE TYPE: peptide 

254 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

256 Arg Lys Lys Arg Arg Gin Arg Arg Arg 

257 1 5 

259 (2) INFORMATION FOR SEQ ID NO : 5: 

261 (i) SEQUENCE CHARACTERISTICS: 

262 (A) LENGTH: 6 amino acids 

263 (B) TYPE: amino acid 

264 (C) STRANDEDNESS: 

265 (D) TOPOLOGY: linear 
267 (ii) MOLECULE TYPE: peptide 

272 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

274 Gly Arg Gly Asp Ser Pro 

275 1 5 

277 (2) INFORMATION FOR SEQ ID NO : 6: 

279 (i) SEQUENCE CHARACTERISTICS: 

280 (A) LENGTH: 7 amino acids 

281 (B) TYPE: amino acid 

282 (C) STRANDEDNESS: 
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2 83 (D) TOPOLOGY: linear 
285 (ii) MOLECULE TYPE: peptide 

290 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

292 Val Asp Pro Arg Leu Glu Pro 

293 1 5 

295 (2) INFORMATION FOR SEQ ID NO : 7: 

297 (i) SEQUENCE CHARACTERISTICS: 

298 (A) LENGTH: 7 amino acids 

299 (B) TYPE: amino acid 

300 (C) STRANDEDNESS : 

301 (D) TOPOLOGY: linear 
303 (ii) MOLECULE TYPE: peptide 

308 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

310 Val Asp Pro Lys Leu Glu Pro 

311 1 5 

313 (2) INFORMATION FOR SEQ ID NO: 8: 

315 (i) SEQUENCE CHARACTERISTICS: 

316 (A) LENGTH: 7 amino acids 

317 (B) TYPE: amino acid 

318 (C) STRANDEDNESS: 

319 (D) TOPOLOGY: linear 
321 (ii) MOLECULE TYPE: peptide 

326 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

328 Val Asp Pro Ser Leu Glu Pro 

329 1 5 

331 (2) INFORMATION FOR SEQ ID NO : 9: 

3 33 (i) SEQUENCE CHARACTERISTICS: 

334 (A) LENGTH: 7 amino acids 

335 (B) TYPE: amino acid 

336 (C) STRANDEDNESS: 

337 (D) TOPOLOGY: linear 
339 (ii) MOLECULE TYPE: peptide 

344 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

34 6 Val Asp Pro Asn Leu Glu Pro 

347 1 5 

34 9 (2) INFORMATION FOR SEQ ID NO: 10: 

351 (i) SEQUENCE CHARACTERISTICS: 

352 (A) LENGTH: 6 amino acids 

353 (B) TYPE: amino acid 

354 (C) STRANDEDNESS: 

355 (D) TOPOLOGY: linear 

3 57 (ii) MOLECULE TYPE: peptide 

3 60 (ix) FEATURE: 

361 (A) NAME/KEY: Binding- site 

362 (B) LOCATION: 6 

363 (D) OTHER INFORMATION: /note= "an amide is attached to the 

364 Ser in position 6" 

367 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
369 Trp Lys His Pro Gly Ser 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:36; Xaa Pos. 1,5, 9 
Seq#:37; Xaa Pos.1,3,13 
Seq#:38; Xaa Pos . 1 , 4 , 5 , 6 , 7 
Seq#:39; Xaa Pos. 1,4, 7, 14 
Seq#:58; Xaa Pos . 4 
Seq#:86; Xaa Pos . 4 
Seq#:87; Xaa Pos . 2 
Seq#:88; Xaa Pos. 3, 4, 5, 6 
Seq#:89; Xaa Pos. 3, 6 
Seq#:110; Xaa Pos . 3 , 6 
Seq#:124; Xaa Pos . 6 
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875 M:341 W 
926 M:341 W 
988 M:341 W 
1044 M:341 W 
1400 M:341 W 
1928 M:341 W 
1951 M:341 W 
1992 M:341 W 
2021 M:341 W 
2453 M:341 W 
2720 M:341 W 



(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) "n" or "Xaa 
(46) "n" or "Xaa 
(46) "n" or "Xaa' 
(46) "n" or "Xaa 1 
(46) "n" or "Xaa 1 
(46) "n" or "Xaa 1 
(46) "n" or "Xaa' 
(46) "n" or "Xaa' 



or invalid format, [(A) APPLICATION NUMBER: ] 
or invalid format, [(B) FILING DATE:] 
used, for SEQ ID#:36 after pos . : 0 
used, for SEQ ID#:37 after pos . : 0 
used, for SEQ ID#:38 after pos . : 0 
used, for SEQ ID#:39 after pos . : 0 
used, for SEQ ID#:58 after pos . : 0 
used, for SEQ ID#:86 after pos . : 0 
used, for SEQ ID#:87 after pos . : 0 
used, for SEQ ID#:88 after pos.:0 
used, for SEQ ID#:89 after pos.:0 
used, for SEQ ID# : 110 after pos . : 0 
used, for SEQ ID#:124 after pos . : 0 
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